A model for correlations in stock markets 
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We propose a group model for correlations in stock markets. In the group model the markets are 
composed of several groups, within which the stock price fluctuations are correlated. The spectral 
properties of empirical correlation matrices reported in [Phys. Rev. Lett. 83, 1467 (1999); Phys. 
Rev. Lett. 83, 1471 (1999.)] are well understood from the model. It provides the connection 
between the spectral properties of the empirical correlation matrix and the structure of correlations 
in stock markets. 
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There has been growing interest of physicists in eco- 
nomic systems. They exhibit various interesting com- 
plex behaviors, e.g., a power-law distribution of the vari- 
ations of financial market indices , and many efforts 
have been devoted to understanding the temporal corre- 
lations [0-^1. The economic systems are composed of a 
large number of interacting units. The information of the 
correlations and interactions is provided by a correlation 
matrix . Its matrix elements are given by the overlap 
between price changes of different assets. They measure 
a extent to which the price fluctuations of different assets 
are correlated. 

Recently Laloux et al. |jj and Plerou et al. @] studied 
spectral properties of the empirical correlation matrix C 
of stock price changes in real markets. Comparing the 
empirical data with the theoretical prediction of the ran- 
dom matrix theory, they observed a correlated aspect of 
the financial markets. However, the nature of the corre- 
lations is not revealed yet. In this work, we propose that 
the observed spectral properties reflect a specific struc- 
ture of correlations in the financial markets. To this end, 
we postulate a model for the correlations and show that 
it explains all aspects of the observed spectral properties. 

Following the notation of , the empirical correlation 
matrix is constructed from the time series of stock price 
changes 6xi(i), where i (i = 1, . . . , N) labels the assets 
and 1 < t < T the time. It is assumed that Sx's are 
shifted and rescaled to have zero mean and a unit vari- 
H| . The correlation matrix has elements 
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(Sxi) ■ (Sxj) 



(1) 



Here we introduced a symbolic notation that x ■ y — 

£eL *(*)*(*)■ 

Suppose that there exist only random correlations: 
Sxi(t) are independent identically-distributed random 
variables. In that case the distribution of eigenvalues 
A of C is exactly known in the limit N, M — > oo with 
Q = T/N > 1 fixed Jh}. The density of the eigenvalues 
is given by 



for A G [A m i n ,A max ] and zero otherwise, where A max = 
(1 + l/y/Q) 2 and A m in = (1 — l/\/Q) 2 . Deviations from 
Eq. (||) provide clues about the structure of the financial 
markets and the nature of the underlying interactions. 

Here we briefly summarize the finding of Ref. 
They constructed the correlation matrix of N assets from 
the major stock markets and calculated the density of 
eigenvalues, p(A), and the inverse participation ratio, I. 
It is related to the reciprocal of the number of eigenvector 
components significantly different from zero, and defined 



as 
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where k (k = 1, . . . , N) refers to the eigenstate and um 
is the lih components of its normalized eigenvector. It is 
observed that p(A) follows the distribution in Eq. (Q) for 
small A < 2.0 Those low-lying eigenstates reflect a 

random noisy part of the correlation. Besides, it is also 
observed that there appear several isolated eigenstates 
for large A > 2.0 [0J^] and that the eigenstates at both 
ends of the spectrum have large value of I ||. However, 
an appropriate explanation for these is not presented. 

To explains the observed spectral properties, we pro- 
pose a model for the correlations, which will be called 
the group model. In the group model it is assumed that 
the stock price changes of companies in the same kind 
of industry have a strong correlation. It is in agreement 
with our everyday experiences and a quantitative study 
supports this 0] . Those sets of companies will be called 
the groups. The group feature is incorporated into the 
model by assuming that the stock price change §Xi(t) of 
a company i (= 1, . . . , N) at time t (= 1, . . . , T) is given 
by a sum of two random variables: 



SXi(t) = f ai (t)+€i(t) , 



(4) 



P(A) 



Q \J (Amax — A) (A — A m i n ) 

2t7 A 



(2) 



where cv^ denotes the group to which a company i be- 
longs, f ai represents a synchronous variation of the group 
OH, and £i a random noisy part. The noise strength is 
parametrized by e ai with which the variances of f ai and 
d are given by (1+£q.) -1 and e 2 a . respectively. 
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The correlation matrix takes the form of C = C° + C 1 
with Cfj = S auCtj (S a ,p ■ the Kronecker delta symbol). 

When e a = and T is infinite, C = C° becomes a 
block diagonal matrix; a direct product of N a x N a ma- 
trices with all elements being unity (N a : the size of the 
group a). Each block has a nondegenerate eigenstate 
with the eigenvalue A = N a and (N a — l)-fold degenerate 
eigenstates with A = 0. The inverse participation ratio of 
the nondegenerate eigenstates is the inverse of the group 
size, I — l/N a . 

For nonzero e a and finite T, the random overlap part 
C 1 becomes nonzero. Its effect can be treated as a per- 
turbation to C°. Up to the first order perturbation, the 
eigenvalues of the nondegenerate states are scaled down 
by a factor (1 + £^) _1 and the degenerate states split 
into the states whose eigenspectrum are given by those 
of C 1 . Therefore, the perturbed degenerate eigenstates 
with small A will result in the distribution of the eigen- 
values of Eq. (||), while the nondegenerate states isolated 
peaks at A ~ (1 + e 2 a )~ l N a as found in 

The information of the correlation is mainly contained 
in the isolated eigenstates. Each one represents a corre- 
lated group, whose size and participating companies are 
obtained from the eigenvector and its eigenvalue. The 
eigenstates with small eigenvalues, which are approxi- 
mately the eigenstates of C 1 , may also provide the corre- 
lation information. Since C 1 generates random overlaps 
between groups, its eigenvector components spread over 
the whole market in general. However, the overlap may 
be confined mainly within the group, provided that the 
noise strength of a group is small. Then there may appear 
an eigenstate with the eigenvalue near the unperturbed 
value, zero, and with relatively large inverse participation 
ratio, as observed in |H|. 

We demonstrate the validity of the group model by 
examining the spectral properties of an artificial corre- 
lation matrix constructed from a time series of Eq. (^) 
with N — 254 and T — 800. A part of them is separated 
arbitrarily into groups of size N a = 2, 4, 8, 16, 32, 64 and 
the remaining ones are assumed to behave independently. 
The noise strength is assigned to e a = (1 — l/N a ). Fig- 
ure [j] shows p(X), ensemble-averaged over 50 realizations 
of the time series, in (a) and the inverse participation ra- 
tio I for a realization in (b) . They are plotted in the log- 
log scale to show the large and small A regions on an equal 
footing. They are in excellent agreement with the empir- 
ical data shown in |t],||]. p(X) for A < 2.0 is well fitted to 
Eq. (g). It deviates from Eq. (§) for A > 2.0. The iso- 
lated peaks represents the groups of size N a = 4, . . . , 64. 
A peak for the group of N a — 2 is smeared with the con- 
tinuous distribution. The inverse participation ratio is 
large at both ends of the spectrum. For large A, A and 
/ are inversely proportional to each other since A is pro- 
portional to N a , and I to l/N a . The eigenstate with the 
smallest eigenvalue has also large value of /. It has large 
components within the group of size N a = 2, which has 



smallest noise strength. 

In summary, we proposed the group model to explain 
the spectral properties of the empirical correlation ma- 
trices. According to the group model, the stock markets 
are composed of several correlated groups of companies. 
The stock price fluctuations of companies within a group 
have a synchronous component. The groups manifest 
themselves as the eigenstates of C with large eigenval- 
ues. The sizes of groups and composing companies are 
known from the eigenvalues and the eigenvectors. It will 
be an interesting work to reveal the group structure of 
the real markets from the empirical correlations matrix. 
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FIG. 1. (a) p(X) (x) and a fitting curve to the form in 
Eq. (g) (solid line), (b) Inverse participation ratio with a plot 
of 1/A against A (dotted line). 
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